Maintaining mirror and storage system copies of volumes at multiple remote sites

ABSTRACT

Provided are a computer program product, system, and method for maintaining mirror and storage system copies of volumes at multiple remote sites. A first server maintains a mirror copy relationship in a computer readable storage medium between a first storage system at a first site and a second storage system at a second site to mirror data at the first storage system to the second storage system, wherein the first and second sites connect over a network. The first server performs a first point-in-time copy operation from the first storage system to a first storage system copy, wherein the data for the first storage system copy is consistent as of the determined point-in-time, wherein the point-in-time copy operation is completed upon creating a data structure indicating data for the first storage system copy as located at the first storage or the first storage system copy. The first server transmits a command to a second server to create a point-in-time copy of the second storage system. The second server processes mirror data transferred from the first server as part of the mirror copy relationship to determine when to create a second point-in-time copy from the second storage system to a second storage system copy in response to the command. The second server performs the second point-in-time copy operation, in response to determining from the mirror data to create the second point-in-time copy.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to a method, system, and computer programproduct maintaining mirror and storage system copies of volumes atmultiple remote sites.

2. Description of the Related Art

Disaster recovery systems typically address two types of failures, asudden catastrophic failure at a single point in time or data loss overa period of time. In the second type of gradual disaster, updates tovolumes may be lost. To assist in recovery of data updates, a copy ofdata may be provided at a remote location. Such dual or shadow copiesare typically made as the application system is writing new data to aprimary storage device. Different copy technologies may be used formaintaining remote copies of data at a secondary site, such asInternational Business Machine Corporation's (“IBM”) Extended RemoteCopy (XRC), Peer-to-Peer Remote Copy (PRRC), Global Copy, and GlobalMirror Copy. Geographically Dispersed Parallel Sysplex (GDPS™) is anend-to-end Resource Management automated solution that manages storagebased data replication solutions, such as those mentioned above, as wellas other resources such as servers and workloads. (GDPS is a registeredtrademark of IBM).

In data mirroring systems, such as PPRC and XRC, data is maintained involume pairs in a consistency group. A volume pair is comprised of avolume in a primary storage device and a corresponding volume in asecondary storage device that includes an identical copy of the datamaintained in the primary volume. Volumes in the primary and secondarystorages are consistent when all writes have been transferred in theirlogical order, i.e., all dependent writes transferred first before thewrites dependent thereon. A consistency group has a consistency time forall data writes in a consistency group having a time stamp equal orearlier than the consistency time stamp. A consistency group is acollection of updates to the primary volumes such that dependent writesare secured in a consistent manner. The consistency time is the latesttime to which the system guarantees that updates to the secondaryvolumes are consistent. Consistency groups maintain data consistencyacross volumes and storage devices. Thus, when data is recovered fromthe secondary volumes, the recovered data will be consistent. In certainbackup systems, a sysplex timer is used to provide a uniform time acrosssystems so that updates written by different applications to differentprimary storage devices use consistent time-of-day (TOD) values as timestamps. Application systems time stamp data sets when writing such datasets to volumes in the primary storage. The time stamp determines thelogical sequence of data updates.

To provide backup of data, a production volume may be mirrored to abackup production volume in the same local site by performing a mirrorcopy operation. To provide protection from a disaster, the data may alsobe mirrored to a geographically remote location. In this way, if theprimary production volume fails, then the system can failover to usingthe backup production volume at the local site. However, if both localsites fail or experience a disaster, then production can switch over tothe remote site mirror copy. Further, a production site may mirror datato multiple sites, including another local site and a remote site. Thelocal site may further send a directive to the remote site to make apoint-in-time copy of the remote site mirror copy. The local site mayreceive a directive to perform a point-in-time copy operation andforward that directive to the remote site to create a remotepoint-in-time copy from the remote mirror copy of the data.

SUMMARY

Provided are a computer program product, system, and method formaintaining mirror and storage copies of volumes at multiple remotesites. A first server maintains a mirror copy relationship in a computerreadable storage medium between a first storage system at a first siteand a second storage system at a second site to mirror data at the firststorage system to the second storage system, wherein the first andsecond sites connect over a network. The first server performs a firstpoint-in-time copy operation from the first storage system to a firststorage system copy, wherein the data for the first storage system copyis consistent as of the determined point-in-time, wherein thepoint-in-time copy operation is completed upon creating a data structureindicating data for the first storage system copy as located at thefirst storage system or the first storage system copy. The first servertransmits a command to a second server to create a point-in-time copy ofthe second storage system. The second server processes mirror datatransferred from the first server as part of the mirror copyrelationship to determine when to create a second point-in-time copyfrom the second storage system to a second storage system copy inresponse to the command. The second server performs the secondpoint-in-time copy operation, in response to determining from the mirrordata to create the second point-in-time copy.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates an embodiment of a network storage environment.

FIG. 2 illustrates an embodiment of a server.

FIG. 3 illustrates an embodiment of a point-in-time command.

FIG. 4 illustrates an embodiment of operations to maintain mirror copiesof a storage at multiple sites.

FIGS. 5 and 6 illustrates an embodiment of operations to maintainpoint-in-time copies of a storage at multiple sites.

FIG. 7 illustrates an embodiment of operations to recover a storagesystem from a remote storage site.

DETAILED DESCRIPTION

Described embodiments provide a system for maintaining duplicate, suchas mirror, and point-in-time copies of a first storage system atmultiple storage sites to allow recovery from one of the multiplestorage sites. A first storage server maintains duplicate or copies ofvolumes in a first storage system at a first site at a second and thirdstorage systems at a second and third sites. Further, point-in-timecopies as of a same point-in-time of the data in the first storagesystem are created at the first, second and third sites before an eventis initiated where operations are performed on the data at the firststorage system. Creating the multiple point-in-time copies allowsrecovery of the data at the first storage system from the first, secondor third sites in the event of a failure.

A point-in-time copy replicates data in a manner that appearsinstantaneous and allows a host to continue accessing the source volumewhile actual data transfers to the copy volume are deferred to a latertime. The point-in-time copy appears instantaneous because complete isreturned to the copy operation in response to generating therelationship data structures without copying the data. Point-in-timecopy techniques, such as the IBM FlashCopy® (FlashCopy is a registeredtrademark of International Business Machines, Corp. or “IBM”), typicallydefer the transfer of a data back to the copy volume until a writeoperation is requested to that data block on the source volume. Datatransfers may also proceed as a background process with minimal impacton system performance. Until the actual data transfer occurs, reads aredirected to the data blocks on the source volume. The point-in-time copyrelationships that are immediately established in response to thepoint-in-time copy command include a bitmap or other data structureindicating the location of blocks in the volume at either the sourcevolume or the copy volume.

FIG. 1 illustrates an embodiment of a network storage environment. Theenvironment includes a first 2 a, second 2 b, and third 2 c sites thatare in different geographical locations. For instance, the first 2 a andthird 2 c sites may be in proximate locations, such as in the samebuilding or facility or in a same local region. The second site 2 b maybe more remote with respect to the first 2 a and third 2 c sites, suchas in a different geographical region, different state, time zone etc.In this way the second site 2 b may provide for data recovery if thereis a failure or disaster involving the first 2 a and third 3 b sites.

Each site 2 a, 2 b, 2 c includes a server 4 a, 4 b, 4 c and a storagesystem 6 a, 6 b, 6 c. Each storage system 6 a, 6 b, 6 c includes one ormore volumes of data. The servers 4 a, 4 b, 4 c may perform apoint-in-time copy operation to create a point-in-time copy of volumesin the storage systems 6 a, 6 b, 6 c resulting in a first storage systemcopy 8 a, second storage system copy (I) 8 b, and third storage systemcopy 8 c, respectively. The point-in-time copies 8 a, 8 b, 8 c may havea subset of the volumes in the storage systems 6 a, 6 b, 6 c. The secondserver 4 b may further create an additional second storage system copyII 10 that is a point-in-time copy of the second storage system 6 b. Apoint-in-time copy is established and completed when relationship datastructures are created and before any data is copied from the sourcestorage system to the secondary or target storage system. In this way, apoint-in-time copy is created almost instantly. Data in a point-in-timecopy may be copied over time from the source to target copy when thesource data is updated or copied on a deferred basis. The servers 4 a, 4b, 4 c represent both host and storage server components at the sites 2a, 2 b, 2 c. For instance, the host server component may executeapplication programs and initiate jobs with respect to storage system 6a, 6 b, 6 c and the storage server component may manage I/O operationsand copy relationships with respect to the storage systems 6 a, 6 b, 6c.

The first 4 a, second 4 b, and third 4 c servers may communicate over anetwork 14. The network 14 may comprise a Local Area Network (LAN),Storage Area Network (SAN), Wide Area Network (WAN), wireless network,etc. The servers 4 a, 4 b, 4 c may comprise an enterprise storageserver, storage controller, blade server, general purpose server,desktop computer, workstation, telephony device, personal digitalassistant (PDA), etc., or other device used to manage I/O requests toattached storage systems 6 a, 6 b, 6 c.

The storage systems 6 a, 6 b, 6 c and storage system copies 8 a, 8 b, 8c, 10 may each comprise storage media implemented in one or more storagedevices known in the art, such as interconnected hard disk drives (e.g.,configured as a DASD, RAID, JBOD, etc.), magnetic tape, solid statestorage devices (e.g, EEPROM (Electrically Erasable ProgrammableRead-Only Memory), flash memory, flash disk, storage-class memory(SCM)), electronic memory, etc. The storage systems 6 a, 6 b, 6 c, 8 a,8 b, 8 c, 10 may be implemented in a distributed storage environment ornetwork storage environment, such as “cloud” storage. The volumesconfigured in the storage systems may comprise a logical arrangement oftracks or blocks of data.

FIG. 2 illustrates an embodiment of a server 4, such as the servers 4 a,4 b, and 4 c, comprised of a host server 18 a and storage server 18 b,each having a processor 20 a, 20 b and a memory 22 a, 22 b including anoperating system 24 a, 24 b. The memories 22 a and 22 b may comprise oneor more volatile or non-volatile memory devices.

The storage server 18 b includes a copy manager 26 to perform datamirroring and create point-in-time copies, a recovery module 28 tohandle a failure at one of the sites 2 a, 2 b, 2 c, point-in-time copyrelationships 30 providing information on point-in-time copies createdby the servers 4 a, 4 b, 4 c, and mirror copy relationship information34. The point-in-time copy relationships 30 indicate the source volumes6 a, 6 b, 6 c subject to a point-in-time copy relationship, the targetvolumes including the point-in-time copy, e.g., 8 a, 8 b, 8 c, 10, and abitmap or other data structure indicating the location of thepoint-in-time copy either at the source volume, e.g., 6 a, 6 b, 6 c, ortarget volume, e.g., 8 a, 8 b, 8 c, 10.

The mirror copy relationship information 34 indicates mirror copies fromthe first storage system 6 a to the second storage system 6 b and thirdstorage system 6 c. In one embodiment, where the third site 2 c is moregeographically proximate to the first site 2 a than the second site 2 b,the mirror copy operations from the first storage system 6 a to the moreremote second storage system 6 b may be asynchronous, where complete toan update request to the first storage system 6 a is returned inresponse to applying the update to the first storage system 6 a, butbefore the update is mirrored to the second storage system 6 b, suchthat the update is mirrored at a later time. The mirror copy operationsfrom the first storage system 6 a to the third storage system 6 b may besynchronous, such that an update to the first storage system 6 a is notcomplete until the update is also applied to the second storage system 6b.

The copy manager 26 and recovery module 28 may be implemented as one ormore software programs loaded into the memory 22 and executed by theprocessor 20. In an alternative embodiment, the copy manager 26 andrecovery module 28 may be implemented with hardware logic, such as anApplication Specific Integrated Circuit (ASIC), or as a programmableprocessor executing code in a computer readable storage medium.

The host server 18 a may include a job scheduler 32 to initiate anoperation for an event, such as a batch job, on a subset of volumes ofthe first storage system 6 a volumes and applications 36 that submit I/Orequests to the storage systems 6 a, 6 b, 6 c. For instance, the eventmay comprise a batch job operation that applies updates to a productionsite, a backup operation, defragmentation operation, etc. The jobscheduler 32 may comprise a task manager or any other type of programfor submitting operations and I/O requests.

The copy manager 26 code in the first 4 a and third 4 b servers managingthe mirror copy relationship therebetween may be implemented usingsynchronous copy operations, such as a peer-to-peer remote copy (PPRC)program. An example of a PPRC program is the IBM GeographicallyDispersed Parallel Sysplex (GDPS)/PPRC copy program that enables theswitch of updates from the first storage system 6 a to the third storagesystem 6 c. The copy manager 26 implemented in the first server 4 a andthe second server 4 b may implement asynchronous remote copy operations.An example of an asynchronous remote copy program is the IBM GDPS/XRCprogram where updates to the first storage system 6 a is mirrored to thesecond storage system 6 b. The described operations may be implementedwith other global recovery programs.

In further embodiments, programs or components described as included inthe host server 18 a may be implemented in the storage server 18 b, andvice versa. Further, the components described in the host server 18 a,and storage server 18 b may be implemented in a single server.

FIG. 3 illustrates an embodiment of a point-in-time copy command 50 thefirst server 4 a sends to the second server 4 b to create apoint-in-time copy of the second storage system 6 a. The point-in-timecopy command 50 may include a point-in-time copy operand 52, the volumesto copy 54 at the target site and optionally a timestamp 56 at which thedata in the point-in-time copy is to be consistent.

In one embodiment, the first storage system 6 a comprises a productionsite and the first server 4 a receives I/O requests for the firststorage system 6 a from remote hosts that connect over the network 14 orfrom applications within the host server 18 a of the first server 4 a.The second site 2 b and third site 2 c provide backup and recoverystorage for the first storage system 6 a, so that the production sitecan be recovered from the second storage system 6 b or third storagesystem 6 c if the data at the first storage system 6 a is no longeraccessible.

FIG. 4 illustrates an embodiment of operations performed by the copymanager 26 in the first server 6 a to initiate and maintain mirror copyrelationships from the first storage system 6 a to the second 6 b andthird 6 c storages. In response to initiating (at block 100) mirror copyoperations, the copy manager 26 copies (at block 102) mirror data fromthe first storage system 6 a to the second storage system 6 b to mirrordata at the first storage system 6 a as part of a mirror copyrelationship 34. The copy manager 26 copies (at block 104) mirror datafrom the first storage system 6 a to the third storage system 6 c tomirror data at the first storage system 6 a as part of a mirror copyrelationship 34. As discussed, the mirror operation to the more remotesecond site 2 b may be asynchronous and the mirror operation to thecloser third site 2 c may be synchronous.

FIGS. 5 and 6 illustrate an embodiment of operations the copy manager 26program installed at the first 4 a, second 4 b, and third 4 c serversperforms to coordinate the creation of mirror and point-in-time copiesof the first storage system 6 a production site at the second 2 b andthird 2 c sites. The first server 4 a begins the copying in response todetecting an event to perform an operation, such as batch processing oranother operation, on a subset of volumes in the first storage system 6a (a subset of the production volumes). The first server 4 a quiesces(at block 202) all I/O operations of the subset of volumes in the firststorage system 6 a subject to the detected operation. In one embodiment,the first server 4 a may determine (at block 204) a point-in-time forthe backup. The first server 4 a then performs (at block 206) a firstpoint-in-time copy operation from the subset of volumes in the firststorage, such as those subject to the operation, to a first storagesystem copy 8 a as of the determined point-in-time at the same firstsite 2 a. The first server 4 a further transmits (at block 208) a firstcommand, such as command 50, to the second server 4 b to create apoint-in-time copy indicating the subset of volumes 54 to point-in-timecopy, and, in certain embodiments, the determined point in time 56 (FIG.3) The first server 4 a further transmits (at block 210) a secondcommand to the third server 4 c at the third site 2 c to create apoint-in-time copy of the subset of volumes in the third storage system6 c. In a synchronous mirroring embodiment, the first and thirdpoint-in-time copy operations may comprise an atomic operation, suchthat the first storage system 6 a point-in-time copy to the firststorage system copy 8 a is not complete until the third storage systemcopy 8 c is confirmed as complete at the third site 2 c.

With respect to FIG. 5, upon the copy manager 26 at the second server 4b receiving the first command 50 to create the point-in-time copy, thesecond server 4 b initiates (at block 232) an operation to processmirror data transferred from the first server 4 a as part of the mirrorcopy relationship to determine whether to perform a second point-in-timecopy operation from the second storage system 6 b to a second storagesystem copy 8 b. In one embodiment where the first command includes adetermined timestamp 56 and the mirrored data has a consistencytimestamp, then the second server 4 b may determine whether a timestampassociated with the mirror data set, such as a consistency group, justtransferred is equal to the timestamp 56 indicated in the command. Insuch an embodiment, the second point-in-time copy operation is performedin response to determining that the timestamp of the mirror data isequal to the timestamp 56 indicated in the command 50 and then mirroreddata with timestamps greater than the timestamp 56 can be applied to thesecond storage system 6 a after the second point-in-time copy operationis performed.

In certain embodiments, the first server 4 a asynchronously mirrors datafrom the first storage system 6 a at the first site 2 a to the secondstorage system 6 b at the site 2 b. Extended Remote Copy (XRC) or otherasynchronous mirroring technologies may be used by the first server 4 ato mirror data to the second site 2 b. In such case, the point-in-timecopy at the second site 2 b needs to occur when the data on the secondstorage system 6 b is I/O consistent to the timestamp 56 sent from thefirst site 4 a. This results in the point-in-time copy at the secondsite 2 b occurring at some real time after the first site 2 apoint-in-time copy, but at the same logical time that the data was I/Oconsistent at the first site 2 a and is later I/O consistent at thesecond site 2 b following data mirroring. For instance, the firstpoint-in-time copy at the first site 2 a occurs at time T1. The firstserver 4 a sends the timestamp 56 indicating T1 to the second server 4b, while all of the data between time T0 & T1 is still being mirrored tothe second site 2 b. When the second server 4 b observes that themirrored data received at the second site 2 b is now at time T2, thesecond server 4 b executes the T1 point-in-time copy after sendingnotification back to the first server 4 a at the first site 2 aindicating that the T1 second point-in-time copy has completed at thesecond site 2 b.

In an alternative embodiment, the processing, by the second server 4 b,of the mirror data to determine whether to create the secondpoint-in-time copy comprises determining whether a predetermined numberof sets, e.g., consistency groups, of mirror data from the first server4 a have been applied to the second storage system 6 b after receivingthe first command. The second point-in-time copy operation is performedin response to applying the predetermined number of consistency groupsto the second storage system 6 b after the receiving of the command.Applying a predetermined number of consistency groups ensures that alldata in the first storage system 6 a at the time of the quiesceoperations is mirrored to the second storage system 6 b before takingthe point-in-time copy to the second storage system copy 8 b. In thisembodiment, the command 50 may not include the timestamp 54.

If (at block 234) the determination is made to perform the secondpoint-in-time copy, then the second server 4 b performs (at block 236)the second point-in-time copy operation to copy a subset of volumes fromthe second storage system 6 b to the second storage system copy 6 c.After completing the second point-in-time copy, the second server 4 b(at block 238) sends a complete message to the first command to thefirst server 4 a. If (at block 234) a determination is made to notperform the second point-in-time copy, then control proceeds back toblock 232 to continue processing sets of mirror data from the firstserver 4 a until the condition for taking the point-in-time copy isdetected. The decision to continue processing the mirror data (from theno branch of block 234) may be performed in response to determining thatthe timestamp of the mirror data is less than the timestamp includedwith the command or in response to not applying the predetermined numberof mirror data sets after receiving the first point-in-time copycommand.

With respect to FIG. 6, the copy manager 26 in the third server 4 creceives (at block 250) the second point-in-time copy command, which maybe part of a synchronous point-in-time copy operation at the firststorage system 6 a, to create a point-in-time copy of the subset ofvolumes at the third storage system 6 b. In response to the secondpoint-in-time copy command, the third server 4 c performs (at block 252)the point-in-time copy operation to copy subset of volumes from thethird storage system 6 c to the third storage 8 c copy. The third server4 c returns (at block 254) complete to the second command.

With respect to FIG. 6, after the first server 4 a receives (at block270) complete of the second 8 b and third 8 c point-in-time copies, andcompletes the first storage system copy 8 a, the first server 4 aunquiesces (at block 272) I/O operations to the first storage to allowI/O operations to proceed against the subset of volumes in the firststorage system 6 a now that the first 2 a, second 2 b and third 2 csites have a point-in-time copies 8 a, 8 b, 8 c having the same dataconsistent as of a same point-in-time when the I/Os to the first storagesystem 6 a was quiesced. The first server 4 a may then perform (at block274) the operations with respect to the subset of volumes in the firststorage system 6 a which were the subject of the point-in-time copies 8a, 8 b, 8 c. In described embodiments, the point-in-time copies mayoccur without disrupting the mirroring of data to volumes, other thanthe subset of volumes subject to the quiesce, to the second 6 b andthird 6 c storages.

When issuing the second and/or third commands to the second 4 b and/orthird 4 c servers, the first server 4 a may send just one command or maysend multiple point-in-time commands for the subset of volumes, such asone point-in-time command for each volume of the subset of volumes.

FIG. 7 illustrates an embodiment of operations performed in the recoverymodule 28 of the second server 4 b to handle a failure at the first 2 aand third 2 c sites. In one embodiment, if the production storage can berecovered from the third site 2 c, then that is preferred in embodimentswhere the third site 2 c is closer to the first site 2 a than the secondsite 2 b. Upon detecting (at block 300) the failure, the second server 4b performs (at block 302) a third point-in-time copy operation of allvolumes in the second storage system 6 b to a second storage system copyII 10. The second storage system copy II 10 may have more recent datamirrored to the second storage system 6 b, such as data mirrored afterquiescing I/Os to the subset of volumes at the first storage system 6 a,than the second storage system copy I 8 c, which is consistent as of thepoint-in-time of the quiescing at the first storage system 6 a. Torecover the production site from the second site 2 a, the second server4 b recreates (at block 304) the subset of volumes for production fromthe second storage system copy I 8 b and recreates (at block 306)volumes other than the subset of volumes, subject to the operation, suchas a batch operation, from the second storage system copy II 10 torecreate data mirrored after the quiesce operation.

With the operations of FIG. 7, the recovered production storage at thethird site 2 c has data in the subset of volumes subject to theoperation consistent as of the point-in-time before the event operation,such as a batch operation, is applied and has data for other volumesconsistent as of the last received mirror data set. Further, if there isa failure during the event operation, then the operations of FIG. 7 maybe performed to place the subset of volumes in the state before theevent operations began. After recovering the production data, the eventoperation may be performed on the recovered production volume.Additionally, the first server 4 a may create point-in-time copies atthe second 2 b and third 2 c sites after event operations completesuccessfully using the operations of FIGS. 5 and 6.

After recovering the production volume from the point-in-time copies 8 cand 10 at the third site 2 c, commands may be issued to make therecovered production volume the primary production volume to whichsystem requests are directed. Alternatively, the recovered productionvolumes may be copied to the production storage system 6 a.

In certain embodiments, the server may provide the ability, such as inthe copy manager, to extract a logical portion or portions of thecontents of the data from a physical volume image. (for exampleextracting a single data set/file, multiple data sets/files or all datasets/files from a physical image backup of the volume) Further, thedescribed embodiments may also compliment such functionality byproviding the ability to take volume images for any and all events, notjust locally, also at remote locations in an automated fashion.

In further embodiments, the point-in-time copy taken on the first siteand third site may be of a logical portion of the involved volumes suchas a file, dataset or other logical entity. The copy taken at the thirdsite might however be a full copy of the involved volumes and could usethe previously mentioned ability to extract the required logicalportions in the case of a recovery at the second site.

Described embodiments provide techniques to mirror data at multiplesites and create point-in-time copies at multiple sites to use in theevent of a failure at the primary site. The copy manager 6 programimplemented in the different servers 4 a, 4 b, 4 c may coordinate thecreation of the point-in-time copies to ensure they all are consistentas of the same time. Consistency is maintained even if certain of thesites have asynchronous data replication, such as at the second site 2b, where data updates may be behind in time with respect to the first 2a and third 2 c sites where mirroring of data is synchronous. Furtherdescribed embodiments provide coordination with a job scheduler or otherprogram applying updates to the production storage system 6 a to provideconsistent copies at all sites 2 a, 2 b, 2 c before a job is scheduled.

The described operations may be implemented as a method, apparatus orcomputer program product using standard programming and/or engineeringtechniques to produce software, firmware, hardware, or any combinationthereof. Accordingly, aspects of the embodiments may take the form of anentirely hardware embodiment, an entirely software embodiment (includingfirmware, resident software, micro-code, etc.) or an embodimentcombining software and hardware aspects that may all generally bereferred to herein as a “circuit,” “module” or “system.” Furthermore,aspects of the embodiments may take the form of a computer programproduct embodied in one or more computer readable medium(s) havingcomputer readable program code embodied thereon.

Any combination of one or more computer readable medium(s) may beutilized. The computer readable medium may be a computer readable signalmedium or a computer readable storage medium. A computer readablestorage medium may be, for example, but not limited to, an electronic,magnetic, optical, electromagnetic, infrared, or semiconductor system,apparatus, or device, or any suitable combination of the foregoing. Morespecific examples (a non-exhaustive list) of the computer readablestorage medium would include the following: an electrical connectionhaving one or more wires, a portable computer diskette, a hard disk, arandom access memory (RAM), a read-only memory (ROM), an erasableprogrammable read-only memory (EPROM or Flash memory), an optical fiber,a portable compact disc read-only memory (CD-ROM), an optical storagedevice, a magnetic storage device, or any suitable combination of theforegoing. In the context of this document, a computer readable storagemedium may be any tangible medium that can contain or store a programfor use by or in connection with an instruction execution system,apparatus, or device.

A computer readable signal medium may include a propagated data signalwith computer readable program code embodied therein, for example, inbaseband or as part of a carrier wave. Such a propagated signal may takeany of a variety of forms, including, but not limited to,electro-magnetic, optical, or any suitable combination thereof. Acomputer readable signal medium may be any computer readable medium thatis not a computer readable storage medium and that can communicate,propagate, or transport a program for use by or in connection with aninstruction execution system, apparatus, or device.

Program code embodied on a computer readable medium may be transmittedusing any appropriate medium, including but not limited to wireless,wireline, optical fiber cable, RF, etc., or any suitable combination ofthe foregoing.

Computer program code for carrying out operations for aspects of thepresent invention may be written in any combination of one or moreprogramming languages, including an object oriented programming languagesuch as Java, Smalltalk, C++ or the like and conventional proceduralprogramming languages, such as the “C” programming language or similarprogramming languages. The program code may execute entirely on theuser's computer, partly on the user's computer, as a stand-alonesoftware package, partly on the user's computer and partly on a remotecomputer or entirely on the remote computer or server. In the latterscenario, the remote computer may be connected to the user's computerthrough any type of network, including a local area network (LAN) or awide area network (WAN), or the connection may be made to an externalcomputer (for example, through the Internet using an Internet ServiceProvider).

Aspects of the present invention are described above with reference toflowchart illustrations and/or block diagrams of methods, apparatus(systems) and computer program products according to embodiments of theinvention. It will be understood that each block of the flowchartillustrations and/or block diagrams, and combinations of blocks in theflowchart illustrations and/or block diagrams, can be implemented bycomputer program instructions. These computer program instructions maybe provided to a processor of a general purpose computer, specialpurpose computer, or other programmable data processing apparatus toproduce a machine, such that the instructions, which execute via theprocessor of the computer or other programmable data processingapparatus, create means for implementing the functions/acts specified inthe flowchart and/or block diagram block or blocks.

These computer program instructions may also be stored in a computerreadable medium that can direct a computer, other programmable dataprocessing apparatus, or other devices to function in a particularmanner, such that the instructions stored in the computer readablemedium produce an article of manufacture including instructions whichimplement the function/act specified in the flowchart and/or blockdiagram block or blocks.

The computer program instructions may also be loaded onto a computer,other programmable data processing apparatus, or other devices to causea series of operational steps to be performed on the computer, otherprogrammable apparatus or other devices to produce a computerimplemented process such that the instructions which execute on thecomputer or other programmable apparatus provide processes forimplementing the functions/acts specified in the flowchart and/or blockdiagram block or blocks.

The terms “an embodiment”, “embodiment”, “embodiments”, “theembodiment”, “the embodiments”, “one or more embodiments”, “someembodiments”, and “one embodiment” mean “one or more (but not all)embodiments of the present invention(s)” unless expressly specifiedotherwise.

The terms “including”, “comprising”, “having” and variations thereofmean “including but not limited to”, unless expressly specifiedotherwise.

The enumerated listing of items does not imply that any or all of theitems are mutually exclusive, unless expressly specified otherwise.

The terms “a”, “an” and “the” mean “one or more”, unless expresslyspecified otherwise.

Devices that are in communication with each other need not be incontinuous communication with each other, unless expressly specifiedotherwise. In addition, devices that are in communication with eachother may communicate directly or indirectly through one or moreintermediaries.

A description of an embodiment with several components in communicationwith each other does not imply that all such components are required. Onthe contrary a variety of optional components are described toillustrate the wide variety of possible embodiments of the presentinvention.

Further, although process steps, method steps, algorithms or the likemay be described in a sequential order, such processes, methods andalgorithms may be configured to work in alternate orders. In otherwords, any sequence or order of steps that may be described does notnecessarily indicate a requirement that the steps be performed in thatorder. The steps of processes described herein may be performed in anyorder practical. Further, some steps may be performed simultaneously.

When a single device or article is described herein, it will be readilyapparent that more than one device/article (whether or not theycooperate) may be used in place of a single device/article. Similarly,where more than one device or article is described herein (whether ornot they cooperate), it will be readily apparent that a singledevice/article may be used in place of the more than one device orarticle or a different number of devices/articles may be used instead ofthe shown number of devices or programs. The functionality and/or thefeatures of a device may be alternatively embodied by one or more otherdevices which are not explicitly described as having suchfunctionality/features. Thus, other embodiments of the present inventionneed not include the device itself.

The illustrated operations of FIGS. 4-7 show certain events occurring ina certain order. In alternative embodiments, certain operations may beperformed in a different order, modified or removed. Moreover, steps maybe added to the above described logic and still conform to the describedembodiments. Further, operations described herein may occur sequentiallyor certain operations may be processed in parallel. Yet further,operations may be performed by a single processing unit or bydistributed processing units.

The foregoing description of various embodiments of the invention hasbeen presented for the purposes of illustration and description. It isnot intended to be exhaustive or to limit the invention to the preciseform disclosed. Many modifications and variations are possible in lightof the above teaching. It is intended that the scope of the invention belimited not by this detailed description, but rather by the claimsappended hereto. The above specification, examples and data provide acomplete description of the manufacture and use of the composition ofthe invention. Since many embodiments of the invention can be madewithout departing from the spirit and scope of the invention, theinvention resides in the claims herein after appended.

1. A computer program product for maintaining copies of data in a firststorage system at a first site and a second storage system at a secondsite, the computer program product comprising at least one computerreadable storage medium including first program code and second programcode, wherein the first program code is executed in a first server atthe first site and the second program code is executed at a secondserver at the second site, wherein the first program code and secondprogram code execute to perform operations, the operations comprising:maintaining, by the first program code, a mirror copy relationshipbetween the first storage system and the second storage system to mirrordata at the first storage system to the second storage system, whereinthe first and second sites connect over a network; performing, by thefirst program code, a first point-in-time copy operation from the firststorage system to a first storage system copy, wherein the data for thefirst storage system copy is consistent as of the determinedpoint-in-time, wherein the point-in-time copy operation is completedupon creating a data structure indicating data for the first storagesystem copy as located at the first storage system or the first storagesystem copy; transmitting, by the first program code, a command to thesecond server to create a point-in-time copy of the second storagesystem; processing, by the second program code, mirror data transferredfrom the first server as part of the mirror copy relationship todetermine when to create a second point-in-time copy from the secondstorage system to a second storage system copy in response to thecommand; and performing, by the second program code, the secondpoint-in-time copy operation, in response to determining from the mirrordata to create the second point-in-time copy.
 2. The computer programproduct of claim 1, wherein the first point-in-time copy operationcopies a subset of volumes in the first storage system to the firststorage system copy, wherein the command to create the point-in-timecopy indicates the subset of volumes, and wherein the secondpoint-in-time copy operation copies the indicated subset of volumes inthe second storage system to the second storage system copy.
 3. Thecomputer program product of claim 1, wherein the operations furthercomprise: transmitting, by the second program code, a complete messageto the first server in response to completing the second point-in-timecopy operation; and initiating, by the first program code, a requestedoperation in response to receiving the complete message from the secondserver and completing the first point-in-time copy operation.
 4. Thecomputer program product of claim 3, wherein the first server performsthe first point-in-time copy operation and transmits the command tocreate the point-in-time copy in response to receiving the requestedoperation.
 5. The computer program product of claim 4, wherein therequested operation is performed in response to an event, wherein therequested operation for the event processes a subset of volumes in thefirst storage system, wherein the first point-in-time copy operationcopies the subset of volumes in the first storage system to the firststorage system copy, wherein the command to create the point-in-timecopy indicates the subset of volumes, and wherein the secondpoint-in-time copy operation copies the indicated subset of volumes inthe second storage system to the second storage system copy.
 6. Thecomputer program product of claim 3, wherein the operations furthercomprise: quiescing, by the first program code, I/O operations at thefirst storage system, wherein the operations of performing the firstpoint-in-time copy operation and transmitting the command to create thepoint-in-time copy are performed after the quiescing of I/O operations;and allowing, by the first program code, I/O operations to proceedagainst the first storage system in response to receiving the completemessage and completing the first storage system copy.
 7. The computerprogram product of claim 1, wherein the operations further comprise:determining, by the first program code, a timestamp, wherein the firststorage system copy is consistent as of the determined timestamp, andwherein the command to create the second point-in-time copy indicatesthe determined timestamp; and wherein the processing, by the secondprogram code, of the mirror data to determine whether to create thesecond point-in-time copy comprises determining whether a timestampassociated with the mirror data applied to the second storage system isequal to the timestamp indicated in the command, wherein the secondpoint-in-time copy operation is performed in response to determiningthat the timestamp of the mirror data is equal to the timestampindicated in the command.
 8. The computer program product of claim 1,wherein mirror data from the first storage system to the second storagesystem is sent in consistency groups having data that is consistent asof a point in time, wherein the processing, by the second program code,of the mirror data to determine whether to create the secondpoint-in-time copy comprises determining whether a predetermined numberof consistency groups of mirror data from the first server have beenapplied to the second storage system after receiving the command,wherein the second point-in-time copy operation is performed in responseto applying the predetermined number of consistency groups to the secondstorage system after the receiving of the command.
 9. The computerprogram product of claim 1, wherein the first server is further incommunication with a third server and third storage system at a thirdsite, wherein the command comprises a first command and the mirror copyrelationship between the first storage system and the second storagesystem comprises a first mirror copy relationship, wherein the at leastone computer readable storage media further includes third program codeexecuted in the third server, wherein the first program code and thirdprogram code are executed to perform operations, the operationscomprising: maintaining, by the first program code, a second mirror copyrelationship between the first storage system and the third storagesystem to mirror data at the first storage system to the third storagesystem, wherein the first and the third sites connect over a network,and wherein the third site is more geographically proximate to the firstsite than the second site; transmitting, by the first program code, asecond command to the third server at the third site to create apoint-in-time copy; and performing, by the third program code, a thirdpoint-in-time copy operation from the third storage system to a thirdstorage system copy in response to the second command.
 10. The computerprogram product of claim 9, wherein the first storage system copy,second storage system copy, and third storage system copy have same dataconsistent as of a same point-in-time.
 11. The computer program productof claim 1, wherein the first and second storage systems are comprisedof volumes, wherein the operations further comprise: detecting, by thesecond computer code, a failure at the first site; performing, by thesecond program code, a third point in time copy operation, in responseto detecting the failure at the first site, from the second storagesystem to a third storage system copy, wherein the third storage systemcopy has data dated later than data in the second storage system copy,and wherein the second storage system copy includes a subset of volumesin the third storage system copy; and recreating, by the first programcode, a recreated first storage system previously at the failed firstsite from the second and third storage system copies.
 12. The computerprogram product of claim 11, wherein the first storage system copy andsecond storage system copy comprise point-in-time copies of a subset ofvolumes in the first and second storage systems, respectively, whereinrecreating the recreated first storage system from the second and thirdstorage system copies comprises: recreating the subset of volumes in thesecond storage system from the second storage system copy; andrecreating volumes other than the subset of volumes from the thirdstorage system copy.
 13. A system comprising a first server at a firstsite coupled to a first storage system at the first site and a secondserver at a second site, wherein the first and second sites connect overa network, wherein the second server is coupled to a second storagesystem at the second site, comprising: a processor; and a computerreadable storage medium having code executed by the processor to performoperations, the operations comprising: maintaining a mirror copyrelationship in the computer readable storage medium between the firststorage system and the second storage system to mirror data at the firststorage system to the second storage system; performing a firstpoint-in-time copy operation from the first storage system to a firststorage system copy, wherein the data for the first storage system copyis consistent as of the determined point-in-time, wherein thepoint-in-time copy operation is completed upon creating a data structureindicating data for the first storage system copy as located at thefirst storage system or the first storage system copy; transmitting acommand to the second server to create a point-in-time copy of thesecond storage system, wherein the command causes the second server toprocess mirror data transferred from the first server as part of themirror copy relationship to determine when to create a secondpoint-in-time copy from the second storage system to a second storagesystem copy in response to the command, and wherein the second serverperforms the second point-in-time copy operation in response todetermining from the mirror data to create the second point-in-timecopy.
 14. The system of claim 13, wherein the operations furthercomprise: receiving from the second server a complete message indicatingthat the second server completed the second point-in-time copyoperation; and initiating a requested operation in response to receivingthe complete message from the second server and completing the firstpoint-in-time copy operation.
 15. The system of claim 14, wherein theoperations further comprise: quiescing I/O operations at the firststorage system, wherein the operations of performing the firstpoint-in-time copy operation and transmitting the command to create thepoint-in-time copy are performed after the quiescing of I/O operations;and allowing I/O operations to proceed against the first storage systemin response to receiving the complete message and completing the firststorage system copy.
 16. The system of claim 13, wherein the operationsfurther comprise: determining a timestamp, wherein the first storagesystem copy is consistent as of the determined timestamp, and whereinthe command to create the second point-in-time copy indicates thedetermined timestamp, and wherein the second server processes the mirrordata to determine whether to create the second point-in-time copy bydetermining whether a timestamp associated with the mirror data appliedto the second storage system is equal to the timestamp indicated in thecommand, wherein the second point-in-time copy operation is performed inresponse to determining that the timestamp of the mirror data is equalto the timestamp indicated in the command.
 17. The system of claim 13,wherein mirror data from the first storage system to the second storagesystem is sent in consistency groups having data that is consistent asof a point in time, and wherein the second server processes the mirrordata to determine whether to create the second point-in-time copy bydetermining whether a predetermined number of consistency groups ofmirror data from the first server have been applied to the secondstorage system after receiving the command, wherein the secondpoint-in-time copy operation is performed in response to applying thepredetermined number of consistency groups to the second storage systemafter the receiving of the command.
 18. A system in communication with anetwork, comprising, a first server and a first storage system at afirst site; a second server and a second storage system at a secondsite; a third server and a third storage system at a third site, whereinthe first server, second server, and third server communicate over thenetwork; wherein the first server includes first program code executedto perform operations, the operations comprising: maintaining a firstmirror copy relationship between the first storage system and the secondstorage system to mirror data at the first storage system to the secondstorage system; maintaining a second mirror copy relationship betweenthe first storage system and the third storage system to mirror data atthe first storage system to the third storage system, wherein the thirdsite is more geographically proximate to the first site than the secondsite; performing a first point-in-time copy operation from the firststorage system to a first storage system copy, wherein the data for thefirst storage system copy is consistent as of the determinedpoint-in-time, wherein the point-in-time copy operation is completedupon creating a data structure indicating data for the first storagesystem copy as located at the first storage system or the first storagesystem copy; transmitting a first command to the second server to createa point-in-time copy of the second storage system; transmitting a secondcommand to a third server at the third site to create a point-in-timecopy; and wherein the second server includes second program codeexecuted to perform operations, the operations comprising: processingmirror data transferred from the first server as part of the mirror copyrelationship to determine when to create a second point-in-time copyfrom the second storage system to a second storage system copy inresponse to the first command; and performing the second point-in-timecopy operation, in response to determining from the mirror data tocreate the second point-in-time copy; wherein the third server includesthird program code executed to perform a third point-in-time copyoperation from the third storage system to a third storage system copyin response to the second command.
 19. A method, comprising:maintaining, by a first server, a mirror copy relationship in a computerreadable storage medium between a first storage system at a first siteand a second storage system at a second site to mirror data at the firststorage system to the second storage system, wherein the first andsecond sites connect over a network; performing, by the first server, afirst point-in-time copy operation from the first storage system to afirst storage system copy, wherein the data for the first storage systemcopy is consistent as of the determined point-in-time, wherein thepoint-in-time copy operation is completed upon creating a data structureindicating data for the first storage system copy as located at thefirst storage system or the first storage system copy; transmitting, bythe first server, a command to a second server to create a point-in-timecopy of the second storage system; processing, by the second server,mirror data transferred from the first server as part of the mirror copyrelationship to determine when to create a second point-in-time copyfrom the second storage system to a second storage system copy inresponse to the command; and performing, by the second server, thesecond point-in-time copy operation, in response to determining from themirror data to create the second point-in-time copy.
 20. The method ofclaim 19, further comprising: transmitting, by the second server, acomplete message to the first server in response to completing thesecond point-in-time copy operation; and initiating, by the firstserver, a requested operation in response to receiving the completemessage from the second server and completing the first point-in-timecopy operation.
 21. The method of claim 20, further comprising:quiescing, by the first server, I/O operations at the first storagesystem, wherein the operations of performing the first point-in-timecopy operation and transmitting the command to create the point-in-timecopy are performed after the quiescing of I/O operations; and allowing,by the first server, I/O operations to proceed against the first storagesystem in response to receiving the complete message and completing thefirst storage system copy.
 22. The method of claim 19, furthercomprising: determining, by the first server, a timestamp, wherein thefirst storage system copy is consistent as of the determined timestamp,and wherein the command to create the second point-in-time copyindicates the determined timestamp; and wherein the processing, by thesecond server, of the mirror data to determine whether to create thesecond point-in-time copy comprises determining whether a timestampassociated with the mirror data applied to the second storage system isequal to the timestamp indicated in the command, wherein the secondpoint-in-time copy operation is performed in response to determiningthat the timestamp of the mirror data is equal to the timestampindicated in the command.
 23. The method of claim 19, wherein mirrordata from the first storage system to the second storage system is sentin consistency groups having data that is consistent as of a point intime, wherein the processing, by the second server, of the mirror datato determine whether to create the second point-in-time copy comprisesdetermining whether a predetermined number of consistency groups ofmirror data from the first server have been applied to the secondstorage system after receiving the command, wherein the secondpoint-in-time copy operation is performed in response to applying thepredetermined number of consistency groups to the second storage systemafter the receiving of the command.
 24. The method of claim 19, whereinthe first storage system is further in communication with a third serverand third storage system at a third site, wherein the command comprisesa first command and the mirror copy relationship between the firststorage system and the second storage system comprises a first mirrorcopy relationship, further comprising: maintaining, by the first server,a second mirror copy relationship between the first storage system andthe third storage system to mirror data at the first storage system tothe third storage system, wherein the first and the third sites connectover a network, and wherein the third site is more geographicallyproximate to the first site than the second site; transmitting, by thefirst server, a second command to a third server at the third site tocreate a point-in-time copy; and performing, by the third server, athird point-in-time copy operation from the third storage system to athird storage system copy in response to the second command.